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Abstract 

After differential screening of a cDNA library corresponding to genes expressed during the early stages of somatic 
embryogenesis in leaf tissue from the Cichorium hybrid '474' (C intybus L., var. sativumXC endivia L., var. latifolia) a 
nonsymbiotic hemoglobin cDNA was obtained. Studies of the expression of the gene corresponding to this clone by 
Northern blot analysis suggest that in Cichorium a nonsymbiotic hemoglobin gene is specifically expressed under somatic 
embryogenesis-inducing conditions, and that its expression is not related to stress caused by wounding or tissue culture 
conditions. © 1998 Elsevier Science B.V. All rights reserved. 
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In higher plants two families of hemoglobins are 
distinguished; the symbiotic and nonsymbiotic he- 
moglobins [1,2]. Symbiotic hemoglobins, or leghemo- 
globins, are only detected in nitrogen-fixing nodules 
formed upon the interaction between bacteria and its 
host plants, whereas nonsymbiotic hemoglobins are 
more widely distributed and are present in both 
monocot and dicot plants. In nitrogen-fixing nodules 
the function of symbiotic hemoglobins is to transport 
oxygen to the bacteroids. At the low oxygen concen- 
tration in infected cells, to avoid damage to the oxy- 
gen-sensitive nitrogenase, leghemoglobins ensure an 
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adequate supply of oxygen for bacterial respiration 
[3]. The function of nonsymbiotic hemoglobins is not 
known, but since their encoding genes are specifically 
expressed in tissue were the oxygen supply is limited 
or that are metabolically very active [1,4-7], they too 
may act as oxygen carriers. 

Here we report the identification of a nonsymbi- 
otic hemoglobin cDNA obtained after differential 
screening of a cDNA library corresponding to genes 
expressed during the induction of somatic embryo- 
genesis in leaf tissue from the Cichorium hybrid '474' 
(C intybus L., var. sativumXC. endivia L., var. lati- 
folia). 

To induce somatic embryogenesis, leaf fragments 
from 6 weeks-old plantlets were cultured in darkness 
for 5 days at 35°C in 50 ml of an agitated (50 rpm) 
liquid basal Ml 7 medium [8], supplemented with 60 
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mM sucrose and 330 mM glycerol- The presence of Clones obtained after differential screening of the 

glycerol delayed the first divisions of embryogenic library with radioactive cDNA probes corresponding 

cells and allowed a relative synchronisation of the to mRNAs isolated from leaf tissue cultured for 

first divisions after transferring the leaf fragments 3 days in embryo-inducing medium and to mRNAs 

to the same medium without glycerol [9]. Poly(A+ ) isolated from non-induced leaf tissue, were subjected 

RNA (1 |ig) isolated from 3 days-old induced leaf to two more rounds of differential screening before 

tissue was used to construct a cDNA library in phage further characterisation. 

lambda Uni-ZAP XR from Stratagene (La Jolla, Comparison of the nucleotide sequences . of the 

CA). Escherichia coli XL-1 blue and SOLR (Strata- clones obtained after differential screening with se- 

gene, La Jolla, CA) were used as hosts for lambda quences in the EMBL GenBank revealed that one 

Uni-ZAP XR and pBluescript plasmids, respectively. of them - CHI3206 - is highly similar to plant non- 
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Fig. 1 . Nucleotide and deduced amino acid sequence of the cDNA clone CH 13206 from the Cichorium hybrid *474\ The putative start 
and stop codons and the sequence corresponding to eukaryotic polyadenylation consensus sequence are underlined. Both strands of 
plasmid DNA of clone CHI3206 were sequenced between the T3 and T7 promoter sequences flanking the pBluescript cloning site. 
The DNA fragments were fluorescently labelled using the Prism AmpliTaq FS Dye Primer kit (PE Applied Biosystems, Foster City, 
CA) and the sequence determined using a LI-COR Long Read IR DNA sequencer in combination with the Base Image IR L-4000-05 
software (LI-COR, Lincoln, NE). 
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Fig. 2. Alignment of the predicted amino acid sequence encoded by clone CH 13206 and five hemoglobins from different plant species. 
Sequences were obtained from the GenBank database accession numbers U94999 (Arabidopsis thaliana), X77695 (Casuarina glauca), 
Z54157 (Vicia faba), ABO 1831 (Pisum sativum), X54089 (Medicago sativa) and U47143 (Glycine max). Alignment of sequences was 
done using CLUSTALW at the Network Protein Sequence ©nalysis (IBCP, France). 



symbiotic hemoglobins. The nucleotide and deduced 
amino acid sequences of clone CHI3206 are shown in 
Fig. 1. The nucleotide sequence of the 616 bp re- 
vealed an open reading frame of 486 bp and 14 bp 
and 102 bp 5' and 3' flanking regions. The CHI3206 
deduced protein is 161 amino acids long, including 
the N-terminal methionine and has a calculated mo- 
lecular mass of 18024 Da and a pi of 5.35. The 
predicted protein contains no signal peptide and no 
putative N-linked glycosylation sites. Sequences 
alignment of the deduced protein and hemoglobins 
from five different plant species in the SwissProt da- 
tabase (Fig. 2), shows 46-63% identity and 71-88% 
similarity. 

Total RNA from non-induced leaf tissue and from 
leaf tissue explants taken from embryo cultures was 
subjected to Northern blot analysis using CHI3206 
cDNA as a probe (Fig. 3A, dO to d5+3). Hybridisa- 
tion of CHI3206 probe with a 0.7 kb transcript was 
observed in leaf tissue upon embryo induction, and 
also in leaf tissue after transfer to expression me- 
dium, albeit at a much lower level. The level of 



CHI3206 mRNA increased from day 1 onward to 
a maximum at day 3 and decreased again thereafter 
(Fig. 3A, dl to d5+3). Scanning the blots suggested 
that the level of CHI3206 mRNA at day 3 was about 
35-fold higher when compared to the level at day 1 
(Fig. 3C). At day 4 and day 5 the levels were still 
about 20- and 10-fold higher respectively, whereas 
3 days after transfer to the expression medium 
(d5+3) the level was about equal to that at day 1. 
No hybridisation of CHI3206 was observed with 
RNA isolated from non-induced leaf tissue (Fig. 
3A, dO), nor with RNA from leaf tissue of a non- 
embryogenic cultivar Orchies of C. intybus when cul- 
tured under embryogenesis-inducing or -expression 
conditions (Fig. 3A, dO to d5+3). These results sug- 
gest that in Cichorium '474' the nonsymbiotic hemo- 
globin gene corresponding to CHI3206 is specifically 
expressed under somatic embryogenesis-inducing 
conditions, and that its expression is not related to 
stress caused by wounding or tissue culture condi- 
tions. The high level of transcripts present at day 3 
suggests that the expression of the hemoglobin gene 
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Fig. 3. Northern blot analysis of the CH 13206 hemoglobin gene 
expression during somatic embryogenesis in Cichorium hybrid 
'474'. (A and B_ Autoradiograms of Northern blots hybridised 
with 32 P probes corresponding respectively to CHI3206 (A) and 
a 18S genomic fragment Xbal-EcoRl (1.4 kb) from Cichorium 
intybus (B). Total RNA (10 p.g) was isolated from non-induced 
leaf tissue (dO, lanes 1, 8), leaf tissue subjected for I, 2, 3, 4, or 
5 days to somatic embryogenesis induction medium (lanes 2-6 
and 9-12), and leaf tissue subjected for 5 days to somatic em- 
bryogenesis induction and cultured for 3 days in expression me- 
dium (lanes 7, 13) of the Cichorium hybrid '474' embryogenic 
responsive line (lanes 1-7) or of the Cichorium cv. Orchies non- 
ernbryogenic responsive cultivar (lanes 8-13). Total RNA was 
separated by agarose electrophoresis and transferred to Gene- 
Screen nylon membranes (NEN Biomedical, Boston, MA). The 
probes for CHJ3206 and 18S were 32 P labelled by random 
priming with the T7 QuickPrime Kit (Pharmacia Biotech, Frei- 
burg, Germany). (C) Transcription level of CHI3206 during in- 
duction of somatic embryogenesis in Cichorium hybrid '474'. 
The lanes of the autoradiograms in (A) and (B) were scanned 
using a Sharp JX330 scanner connected to a Sun Sparc 20 mi- 
crocomputer. The TIFF files were analysed using the Whole 
Band Analyzer software from Biolmage (Biolmage Corpora- 
tion, Ann Arbor, MI). Small variations between total mRNA 
samples were corrected by using the ratio of hybridisation sig- 
nals of CHI3206 hemoglobin and 18S rRNA hybrid. 

is correlated with the initiation of somatic embryo- 
genesis rather then with somatic embryogenesis per 
se. The presence of the hemoglobin mRNA after 



transfer of the leaf tissue explants to the expression 
medium may be due to asynchronous initiation of 
somatic embryogenesis in the expression medium [8]. 

Upon the first discovery of nonsymbiotic hemoglo- 
bins [10], Appleby et al. [11] suggested that nonsym- 
biotic hemoglobins may be involved in facilitating 
the diffusion of oxygen and/or the sensing the levels 
of oxygen available in roots. They hypothesised that 
under normal conditions hemoglobin would be oxy- 
genated and that an increased level of deoxyferous 
hemoglobin would trigger a response to hypoxic con- 
ditions. In barley and maize, the expression of a non- 
symbiotic hemoglobin gene was induced under hy- 
poxic conditions [6,10], and in Casuarina glauca too 
the expression seemed restricted to internal tissues, 
where oxygen supply may be limiting [4]. During 
somatic embryogenesis the leaf explant tissue is sub- 
merged in liquid medium, and though the cultures 
are agitated, hypoxic conditions may result, particu- 
lar at the elevated temperature (35°C) conditions. 
These hypoxic conditions then may be responsible 
for inducing the expression of the hemoglobin gene 
in the embryogenic responsive cultivar. However, in 
leaf tissue explants from a non-embryogenic C. inty- 
bus cv. Orchies cultured under similar conditions the 
hemoglobin gene is not expressed (Fig. 3A). The in- 
duction of embryogenesis in Cichorium e 474 5 is ac- 
companied by the reactivation of somatic cells [8], 
a developmental switch that undoubtedly requires a 
complex array of energy-demanding processes, and it 
may be that the hemoglobin is involved in the energy 
generation. The question raised is whether the ex- 
pression of the hemoglobin gene is triggered by the 
local oxygen limiting conditions in the reactivated 
cells or whether the induction of the hemoglobin 
gene expression requires a more specific signal. Pro- 
moters of symbiotic and nonsymbiotic hemoglobin 
genes were shown to direct both root-specific and 
nodule-specific expression of the P-glucuronidase 
gene in transgenic plants, indicating that these pro- 
moters share conserved as-acting DNA motifs in- 
volved in the regulation of gene expressipn [4,12]. 
Initially, symbiotic hemoglobins were thought to be 
typical late nodulins, being present only after forma- 
tion of the nodule [3]. However, it was found recently 
in Vicia sativa (vetch) that the expression of the leg- 
hemoglobin gene VsLBl may actually be one of the 
first responses upon Rhizobium-legume interaction 
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[5]. Within 1 h after the application of Rhizobium 
lipo-chitin oligosaccharides, called Nod factors [13], 
VsLBl mRNA was detectable in the root hairs of the 
susceptible zone. It was suggested that the leghemo- 
globin may facilitate oxygen supply to support mito- 
chondrial respiration in the rapidly respiring root 
hairs [5]. It may be that in Cichorium both somatic 
embryogenesis and hemoglobin gene expression are 
induced in response to the same signal molecule(s), 
similar to the induction of root hair deformation and 
the expression of the leghemoglobin gene by Nod 
factors in the water-logged root hairs in the Rhi- 
zobium-Vicia interaction [5]. In this respect, the pos- 
sible involvement of Nod factor-like signal molecules 
in somatic embryogenesis [14] is particularly interest- 
ing. 

Somatic embryogenesis is the result of dedifferen- 
tiation and reacquisition of a morphogenic compe- 
tence. The transient expression of a nonsymbiotic 
hemoglobin gene during the induction of somatic 
embryogenesis may be related either to the dediffer- 
entiation process or to the reacquisition of embryo- 
genic competence. In a previous study [15], three 18.1 
kDa acidic proteins, p/~5, were found to be tran- 
siently present at a low level in leaf tissue explants 
during the induction phase of Cichorium '414' so- 
matic embryogenesis. The physico-chemical parame- 
ters of these proteins are similar to the putative he- 
moglobin encoded by CHI3206, and we are currently 
in pursuit of their identity, Further analysis of the 
expression of the CHI3206 gene, in particular by in 
situ hybridisation, may facilitate the unravelling of 
the role of hemoglobin during Cichorium somatic 
embryogenesis. 
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Leguminous plants have both symbiotic and nonsym- 
biotic hemoglobin (sym- and nonsym-Hb) genes. Three 
symbiotic (LjLbl, 2, 3) and one nonsymbiotic (LjNSGl) Hb 
genes were isolated from a genomic library of Lotus japoni- 
cus MG20 Miyakojima. Two motif sequences (AAAGAT 
and CTCTT) critical for nodule specific expression were 
conserved on the 5'-upstream sequence of LjLbl, 2 and 5. 
The 5'-upstream region of LjNSGl contained the sequence 
consensus for nonsym-Hb. RT-PCR with specific primer 
sets for each LjLb gene showed that all the sym-Hb genes 
(LjLbl, 2, 3) were expressed specifically and strongly in root 
nodules. The expression of LjLbl, 2 and 3 could not be 
detected in root, leaf or stem of a mature plant, whereas low 
level expression was detected in seedlings by RT-PCR. This 
suggests that sym-Hbs may have another unknown func- 
tion besides being oxygen transporter for the microsymbi- 
ont in symbiotic nitrogen fixation. The expression of 
LjNSGl, examined with RT-PCR, was detected at low level 
in root, leaf and stem. The expression of LjNSGl was 
enhanced in root nodules, whereas it was repressed in roots 
colonized by mycorrhizal fungi Glomus sp. RIO. The 
repression of the nonsym-Hb gene was also observed in the 
roots of Medicago sativa colonized by Glomus sp. RIO. 

Keywords: Expression — Hemoglobin — Lotus japonicus — 
Mycorrhiza — Nodule. 

Abbreviations: AM, arbuscular mycorrhiza; Hb, hemoglobin; Lb, 
leghemoglobin; nonsym-Hb, nonsymbiotic hemoglobin; sym-Hb, sym- 
biotic hemoglobin. 

The nucleotide sequences reported in this paper have been sub- 
mitted to DDBJ under accession numbers AP004624, AP004625, 
AP004626, AP004627, AP004628 and AP004629. 



Introduction 

Hemoglobin (Hb) in legumes, so called leghemoglobin 
(Lb), exists abundantly in root nodules and regulates oxygen 
concentration to create a micro-aerobic environment for nitro- 



genase activity of rhizobia in the symbiotic zone. Hbs have 
also been found in the root nodules of nitrogen-fixing non- 
legumes such as Casuarina glauca, Myrica gale and Alnus glu- 
tinosa. These Hbs, which are nodule specific or associate with 
a symbiotic interaction with microorganisms, are called "sym- 
biotic hemoglobins (sym-Hbs)". Lbs, i.e. sym-Hbs in legumes, 
are encoded by a family of genes and multiple isomers exist in 
nodules. 

Other Hbs showing only a limited amino acid sequence 
similarity to the sym-Hbs have been identified in non-nodulat- 
ing plant Hordeum, Triticum, Arabidopsis, Zea and Oryza 
(Arredondo-Peter et al. 1998). These findings suggest that Hbs 
are contained in all plants, and the Hbs in non-nodule tissues 
are called "nonsymbiotic hemoglobins (nonsym-Hbs)". Non- 
sym-Hbs have also been identified in legumes, Glycine max 
(Andersson et al. 1996) and Medicago sativa (Seregelyes et al. 
2000) and in nodulating non-legumes Casuarina glauca 
(Jacobsen-Lyon et al. 1995). A nonsym-Hb gene in M. sativa , 
Mhbl, is expressed under hypoxia as are other nonsym-Hbs 
(Seregelyes et al. 2000). However, it is not clear whether non- 
sym-Hb has any role in symbiosis with microorganisms. 

In this report, genomic clones of three symbiotic and one 
nonsymbiotic Hb genes were isolated from the genomic library 
of a model legume Lotus japonicus MG20 Miyakojima (Sato et 
al. 2001). To examine the response of these Hb genes to 
microsymbionts, the expression of each sym- and nonsym-Hb 
gene was analysed by RT-PCR. We focused on the expression of 
Hb genes in plants that are in symbiosis with Mesorhizobium loti 
and Glomus sp. The expression of the nonsym-Hb gene of M 
sativa was also examined in plants that are in symbiosis with 
Glomus sp. for comparison with that of L. japonicus. This is 
the first report about the expression pattern of nonsym-Hb gene 
within the plant that is in symbiosis with mycorrhizal fungi. 

Results 

Structure of sym-Hb genes ofL. japonicus 

Two sets of PCR primer pairs, LjLbFl/LjLbRl and 
LjN77FI/LjN77Rl, were used for amplification of the part of 
genomic sym-Hb genes of L. japonicus. Three DNA fragments 
were amplified by PCR using genomic DNA of L. japonicus 
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Fig. 1 Intron and exon structure of sym- and nonsym-Hb genes of L. japonicus. Exons are indicated by black boxes. The numbers indicate the 
size of exons and introns in bp. The small arrows indicate the annealing positions of primers for PCR. 
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ation codon ATG The motifs resembling nodulin motif (AAAGAT and CTCTT) were found in 5'-upstream of the three sym-Hb genes (LjLbJ, 
LjLb2, LjLb3) of L. japonicus. The motif GAGGG was conserved in 5'-upstream of the nonsym-Hb genes of plants. The motifs are indicated by 
boldface in the gray boxes. 



Miyakojima MG20 as a template, suggesting that at least three 
independent sym-Hb genes exist on the genome. Another three 
sets of primer pairs, Lj LbF2/Lj Lb 1 R, Lj LbF2/LjLb2R and 
LjLbF2/LjLb3R, were redesigned according to the sequences 
of the three fragments generated by PCR using primer pairs of 
LjLbFl/LjLbRl and LjN77Fl/LjN77Rl, and were used for 
screening the 3D-pool of the genomic library constructed by 
the TAC vector (Liu et al. 1999, Sato et al. 2001). As a result of 
screening, three independent sym-Hb genes were identified and 
were referred to as LjLbl, LjLb2 and LjLb3 (Fig. 1). The TAC 
clones on which LjLbJ and LjLb2 were located adjoined each 
other. Furthermore, LjLb3 was mapped approx. 3 cM away 
from LjLbl and LjLb2 on the same linkage group (linkage 
group 5, marker names are TM0089 for LjLbl and LjLb2 and 
TM0090 for LjLb3; http://www.kazusa.or.jp/lotus/). The genomic 
sequences of the three sym-Hb genes were decided including a 



5 '-promoter region and 3'-UTR. Although the sequences of the 
four exons and two of the three introns were conserved, the 
third intron was specific to each sym-Hb gene. No significant 
homology could be detected in the 3'-UTR except for the puta- 
tive signal sequence (AATAAA) for poly-A addition. The 
sequences of the 5 '-upstream regions showed high homology 
with the promoter sequence of sym-Hb gene of other legumi- 
nous plants. The nodulin motif (A AG AT and CTCTT) like 
sequences were found in this region (Fig. 2). For LjLbl, the 
spacing ( 1 4 bp) between the motif sequences differs from that 
in other sym-Hb genes LjLb2 (6 bp) and LjLb3 (5 bp). 

Structure of nonsym-Hb gene ofL. japonicus 

Two sets of primer pairs for amplification of nonsym-Hb 
gene, LjNSGFl/LjNSGRl and LjNSGF2/LjNSGR2, were 
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Fig. 3 Southern hybridization probed with the partial fragments of 
the each globin gene. Genomic DNA of L. japonicus was digested by 
Pst\ (lane P), HM\\\ (lane H) and £coRI (lane E), respectively. The 
restriction fragments hybridized with the respective probes are indi- 
cated by closed arrowheads and cross hybridized fragments are indi- 
cated by open arrowheads. 



designed according to the sequence of the nonsym-Hb homo- 
logue appearing in seedling expressed sequence tags (ESTs) of 
L. japonicus (Asamizu et al. 2000). Two DNA fragments were 
amplified by PCR with each primer pair and the sequences of 
the amplified products supported the fact that these DNA frag- 
ments were derived from the same gene on the genome. From 
the 3-D pool of the genomic library, one TAC clone was 
screened using the primer pair LjNSGFl/LjNSGRl for non- 
sym-Hb gene. The Hb gene on this clone showed high homol- 
ogy with known nonsym-Hb genes and was referred to as 
LjNSGl (Fig. 1). LjNSGl was mapped on the linkage group 3 
(marker name is TM0091, http://www.kazusa.or.jp/lotus/). The 
highly conserved sequence (GAAGGG) among the nonsym-Hb 
genes existed on the 5'-upstream region of LjNSGl and one of 
the nodulin motif sequences (CTCTT) was also found (Fig. 2). 
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Fig. 4 Phylogenetic tree based on the amino acid sequences of plant hemoglobins. The tree was constructed by neighbour-joining method. The 
numbers on the branches show bootstrap probabilities (as percentages) determined from 1,000 resamplings. The database accession numbers are 
indicated in parentheses after the plant names. 
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Fig. 5 Expression of sym-Hb and nonsym-Hb genes in different tis- 
sues of a mature plant of L. japonicus. Total RNAs were isolated from 
leaf (lane L), stem (lane S), root (lane R), and nodule (lane N). RT- 
PCR was performed using specific primers for each sym- and nonsym- 
Hb genes. The PCR product derived from transcript of LjLbl is indi- 
cated by an arrowhead. The transcripts of LjLbs were detected only in 
nodules, whereas the transcript of LjNSGI was detected in all the tis- 
sues tested. The eIF-4A of L. japonicus was used as a positive control. 
The number of amplification cycles was 20. Lane M, size marker. 



Southern hybridization probed with sym- and nonsym-Hb genes 
of L. japonicus 

Southern hybridization probed with sym- and nonsym-Hb 
genes was performed against genomic DNA of L. japonicus 
(Fig. 3). Total genomic DNA was digested with restriction 
endonucleases Pst\, ///mil 1 1 and EcoRl, and separated on 0.7% 
agarose gel. After blotting onto the nylon filter, hybridizations 
were carried out using 32 P-labeled probes prepared by PCR 
using the corresponding TAC clone as a template. The hybridi- 
zation profiles obtained from LjLbl- and Z/Z,62-probes were 
identical each other, because LjLbl and LjLbl were located on 
the same Pst\ fragment, and cross hybridization occurred for 
HindWl and EcoRl fragments (Fig. 3, LjLbl, LjLbl). Hybridi- 
zation profiles obtained from LjLbl- and LjNSGI -probe were 
specific to each probe (Fig. 3, LjLbl, LjNSGI). All the hybrid- 
ized fragments detected were consistent with the restriction 
maps of each TAC clone carrying sym- and nonsym-Hb genes. 

Phytogeny of Sym- and Nonsym-Hbs of L. japonicus with Hbs 
in other plants 

A phylogenetic tree was constructed comparing the amino 
acid sequences with a known plant hemoglobin, using the 
neighbour-joining method in the ClustalW program (Fig. 4). 
The predicted amino acid sequences of the three sym-Hbs 
(LjLbl, LjLb2 and LjLb3) of L Japonicus form a cluster within 
a large cluster of sym-Hbs and with VfLb29 of the gene 
expression induced by the infection of the mycorrhizal fungi 
(Fruhling et al. 1997). The predicted amino acid sequence of 
LjNSGI, nonsym-Hb of L. japonicus, belonged to a cluster 
with other nonsym-Hbs of leguminous plants. 
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Fig. 6 Expression of sym-Hb genes in young shoots and nodules of 
L, japonicus. (A) Total RNA was prepared from young shoots of three 
days after germination. RT-PCR was performed using specific primers 
for each Hb gene. The PCR product derived from the transcript of 
LjLbl is indicated by an arrowhead. Lane M, size marker. (B) Total 
RNA was prepared from young shoots (Y) or mature nodules (N). The 
degenerate primer pair for the three LjLbs was used for amplification. 
The elF-4A was used as a positive control. 



Expression of sym- and nonsym-Hb genes in nodules and in 
other tissues 

The expression of sym- and nonsym-Hb genes in nodules 
and in other tissues of L. japonicus was analysed by RT-PCR. 
For detection of the transcripts of LjLbl, LjLbl, LjLbl and 
LjNSGI, primer pairs of LjLbF2/LjLblR, LjLbF2/LjLb2R, 
LjLbF2/LjLb3R and LjNSGFl/LjNSGRl were used, respec- 
tively. The primer pair of LjLbF2/LjLb3R generated two 
cDNA fragments; one was a derivative from a transcript of 
LjLbl and the other was from LjLbl (Fig. 5, LjLbl lane N; Fig. 
6A, lane LjLbl). Although the primer pair of LjLbF2/LjLb3R 
is not specific to LjLbl transcript, the pair is useful for detec- 
tion of the LjLbl transcript. RT-PCR using primer pairs for 
sym-Hb genes (LjLbl, 1 and I) showed that all. the sym-Hb 
genes were expressed specifically and strongly in root nodules 
(Fig. 5). The significant difference on gene expression did not 
exist in the case of the three sym-Hb genes of L. japonicus. The 
expression of the sym-Hb genes could not be detected in root, 
leaf and stem of a mature plant even though the number of 
amplification cycles was increased to 35. However, the low 
level expression was detected in seedlings 3 d after germina- 
tion (Fig. 6). The transcripts from all the sym-Hb genes were 
detected (Fig. 6A) and the expression level in young shoots 
was estimated to be very low compared with that in nodules 
(Fig. 6B). The expression of L/N16, one of the representative 
late nodulin genes (Kapranov et al. 1997), could not be 
detected, indicating that there was no RNA contamination from 
nodules into the RNA prepared from young seedlings (data not 
shown). This suggests that sym-Hb has unknown function in 
young tissues other than oxygen transporter for the microsym- 
biont in nodules. 
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Fig. 7 Expression of sym-Hb and nonsym-Hb genes in nodules and 
mycorrhiza. Total RNAs were isolated from nodules (lane N), roots 
colonized by Glomus sp R-IO (lane +) and roots not infected with M. 
loti and Glomus (lane - ). The transcripts of LjLbs were detected only 
in nodules. The amount of the transcript of LjNSGl seems to be low in 
mycorrhiza (LjNSGl, lane +) compared with that in the root without 
infection (LjNSGl, lane — ). Ljlbl-3 indicates the amplified product 
using the degenerate primer pair for the three LjLbs. The elF-4A of L. 
japonicus was used as a positive control. 

The expression of nonsym-Hb gene (LjNSGl) was 
detected at low level in root, leaf and stem by RT-PCR (Fig. 5). 
LjNSGl was also expressed in nodules and the level of expres- 
sion was enhanced compared with that in the other tissues (Fig. 
5). The amount of LjNSGl transcripts in total RNA was esti- 
mated by quantitative RT-PCR and compared between nodules 
and other tissues. The amount of transcripts of LjeIF-4A was 
used for normalization. As the result of quantitative real time 
RT-PCR, the expression level of LjNSGl in root nodules was 
almost 20 times higher than that in roots (Fig. 8). 

Expression of sym- and nonsym-Hb genes in mycorrhiza 

The expression of sym-Hb genes LjLbl, 2 and 3 could not 
be detected in the roots colonized by mycorrhizal fungi Glomus 
sp. RIO (Fig. 7). Any amplified DNA fragments could not be 
detected even though the number of amplification cycles in 
PCR was increased up to 35, This indicates that the infection of 
mycorrhizal fungi does not induce the expression of sym-Hb 
genes in L. japonicus. 

The expression of nonsym-Hb gene LjNSGl was 
repressed in the roots colonized by mycorrhizal fungi Glomus 
sp. R-10 (Fig. 7). The expression level of LjNSGl in the roots 
detached nodules was estimated to be almost the same as that 
in the roots without any infection (data not shown). Thus, the 
repression of LjNSGl is expected to be a response to the infec- 
tion of Glomus sp. The amount of transcripts of LjNSGl was 
compared between RNA from roots colonized by Glomus sp. 
and from roots without any infection by quantitative real time 
RT-PCR (Fig. 8). The amount of transcripts of LjelF-4A was 
used for normalization. The expression level of LjNSGl in the 
roots colonized by Glomus sp. was 10 times less than that in 
the roots without colonization. 




Fig. 8 Relative amounts of the transcript of LjNSGl. The amounts of 
the transcript in different tissues were estimated by quantitative real 
time RT-PCR and indicated as the amount in the normal root was 1 .0. 
The amount of the transcript of LjeIF-4A was used for normalization. 
The values indicate the average of three independent experiments. The 
expression of LjNSGl in root nodules (N) was enhanced and that in 
mycorrhiza (AM) was repressed compared with normal roots (R). L, 
leaf; S, stem; R, normal root (root without Glomus sp. R-10); N, nod- 
ules; AM, root colonized by Glomus sp. R-10. 
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Fig. 9 Expression of nonsym-Hb gene in mycorrhiza of Medicago 
sativa. Total RNAs were isolated from roots with (AM+) or without 
(AM-) colonization of Glomus sp. R-10. RT-PCR was performed 
using specific primers for Mhbl, nonsym-Hb gene of M. sativa. Ribos- 
omal RNA of M. sativa was used as a positive control. 

To confirm whether repression on nonsym-Hb gene 
expression is specific to L. japonicus or not, the expression of 
Mhbl, nonsym-Hb gene of Medicago sativa, was analyzed by 
RT-PCR (Fig. 9). The expression of Mhbl was also repressed 
in the mycorrhiza of Medicago sativa. The expression level of 
Mhbl in mycorrhiza was estimated to be eight times less than 
that in the normal roots by quantitative RT-PCR (data not 
shown). These results suggest that mycorrhizal fungi control 
the expression of the nonsym-Hb gene during the establishment 
of symbiosis with leguminous plants. 

Discussion 

Lbs, sym-Hbs in legumes, are the most abundant soluble 
proteins in root nodules and their genes belong to multigene 
families (Brisson and Verma 1982, Appleby 1992). At least 
three sym-Hb genes, LjLbl, LjLb2 and LjLbl, were identified 
in the genome of L. japonicus. The sym-Hb promoters contain 
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two motifs that have been shown to be critical for nodule spe- 
cific expression (the nodulin motif, AAAGAT and CTCTT sep- 
arated by six or seven nucleotides). The nodulin motifs of the 
three LjLb genes are slightly different each other (Fig. 2), 
whereas no significant differences were detected by RT-PCR in 
the quantity of mRNA in each gene in the root nodules (Fig. 5). 
Pisum sativum has at least five genes for sym-Hbs. The quan- 
tity of mRNA . in each of the five genes was different in the 
nodules (Kawashima et al. 2001). Expression of the PsLb5-W 
gene was the highest, and that of PsLb 120-8 was the lowest. 
The nodule morphology of L. japonicus is the determinate type 
and that of P sativum is the indeterminate type. The difference 
in the relative amounts of sym-Hb transcripts in the nodules 
will be due to the difference of nodule type between P sativum 
and L. japonicus 

Although sym-Hb has been thought to be a representative 
late nodulin, Heidstra et al. (1997) reported that the expression 
of sym-Hb gene of Vicia sativa, VsLbl, was induced within 1 h 
by a Nod-factor addition, unlike other late nodulin genes. In 
mature plants of L. japonicus, the expression of the three sym- 
Hb genes could be detected only in the root nodules the as 
same as sym-Hb genes in other legumes. Surprisingly, the tran- 
scripts of the three sym-Hb genes were detected in young 
shoots (3 d after germination) independent of Nod factor addi- 
tion or M loti infection, even though the expression was very 
low compared with that in nodules (Fig. 6). This result suggests 
that sym-Hb has another unknown function other than being 
the oxygen transporter for microsymbionts. 

VfLb29 J one of the sym-Hb genes of Vicia faba, is known 
to be induced both in root nodules and in roots colonized by the 
arbuscular mycorrhiza (AM) fungus Glomus fasciculatum, and 
other sym-Hb genes (LfLbB, VjLbK, VjLb49) were induced 
only in root nodules (Fruhling et al. 1997). The possible func- 
tion of VfLb29 was to supply oxygen for the respiration of the 
microsymbiont. The three sym-Hb genes in L. japonicus were 
not induced in the roots colonized by Glomus sp. R-10 (Fig. 7). 
The well-developed hyphae on the surface of the roots might 
transport sufficient oxygen to internal hyphae so that Glomus 
sp. R-10 does not need the help of sym-Hb in the host plants, 
and/or the control of expression might be quite different among 
VfLb29 and sym-Hb genes in L. japonicus. 

A gene that encodes nonsym-Hb was isolated from 
genomic library of L. japonicus and was referred to as LjNSGL 
In C. glauca that establishes symbiotic nitrogen-fixing asso- 
ciations with the actinomycete Frankia, expression of the 
nonsym-Hb gene (cashb-nonsym) was not detectable in nod- 
ules (Jacobsen-Lyon et al. 1995). On the other hand, the non- 
sym-Hb gene (Soyhb) of G. max was expressed in root nodules 
and the expression level was slightly higher compared with that 
in shoots, leaves, flower buds and root (Andersson et al. 1996). 
LjNSGl was expressed in all of the tissues tested and the 
expression level was estimated to be 20-fold higher in nodules 
compared with that in roots by quantitative RT-PCR (Fig. 5, 8). 
The sequence of 5 '-upstream region of LjNSGl contains the 



nodulin motif-like sequences (CACGAT-6 bases spacer- 
CTCTT), which resemble the nodulin motif sequences (CAA- 
GAT-6 bases spacer-CTCTT) of sym-Hb genes of C. glauca. 
The promoter analysis of nonsym-Hb of Trema tomentosa, a 
kind of elm tree and non-nodulating plant, was reported by 
Andersson et al. (1997). The nodulin motif-like sequences 
(AAGGAG-4 bases spacer-CTCTC) and consensus sequence 
of nonsym-Hb (GAAGAG) were also found in the promoter of 
the nonsym-Hb gene of T. tomentosa (Fig. 2), and it was pre- 
sumed that both AAGGAG and GAAGAG sequences were 
critical for the nonsym-Hb expression in nodules and roots of 
transgenic Lotus corniculatus . Both the consensus sequence of 
nonsym-Hb like sequence (GAAGGG) and nodulin motif like 
sequences (CACGAT-4 bases spacer-CTCTT) were found on 
the promoter region of LjNSGL These sequences might be crit- 
ical for the enhanced expression of LjNSGl in root nodules of 
L. japonicus. 

There are several reports about expression and induction 
of nonsym-Hb gene in plants. The nonsym-Hb gene of soy- 
bean was expressed in many tissues but at different levels 
(Andersson et al. 1996). Accumulation of mRNA from the 
nonsym-Hb gene or the nonsym-Hb protein was detected under 
hypoxia in barley (Taylor et al. 1994) and in rice plants (Lira- 
Ruan et al. 2001). The nonsym-Hb gene in barley is shown to 
be involved in ATP metabolism under hypoxia (Sowa et al. 
1998). The expression profile of the two nonsym-Hb genes of 
A. thaliana, AHB1 and AHB2, is also well documented. The 
expression of AHB1 was induced by hypoxia in roots and 
rosettes, whereas that of AHB2 was induced by low tempera- 
ture (Trevaskis et al. 1997). Wang et al. (2000) revealed that 
AHB1 also responds to the level of nitrate using the micro- 
array of A. thaliana. Mhbl, the nonsym-Hb gene of M. sativa, 
was also inducible by hypoxia but not by low temperature. The 
level of the transcript of Mhbl was increased at the G2/M 
boundary in a synchronized suspension culture (SeregeMyes et 
al. 2000). 

In this report, we focused on the response of Hb genes to 
microsymbionts and showed that the expression of the nonsym- 
Hb genes (LjNSGl and Mhbl) was repressed in mycorrhiza 
both in L. japonicus and M. sativa (Fig. 7, 8, 9). We do not 
have a reliable explanation for this observation, because 
nonsym-Hb is expected to have multiple roles in different plant 
tissues and to be involved in several metabolic pathways as 
described by Arredondo-Peter et al. (1998). Nonsym-Hb will 
be able to bind to small molecules known to be ligands of 
hemoglobins, e.g. 0 2 , CO and NO. NO is now known to be a 
signal for activation of the plant defense system (Durner et al. 
1998, Klessig et al. 2000). Many investigators have reported 
that the AM-symbiosis reduces root disease caused by several 
soil-born pathogens (Davis and Menge 1980, Bartschi et al. 
1981, Mark and Cassells 1996, Murphy et al. 2000, Norman 
and Hooker 2000). Recently, Pozo et al. (2002) reported that 
Glomus mosseae was effective in reducing disease symptoms 
caused by Phytophthora parasitica infection on tomato roots 
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and that new isoforms of defense-related proteins (chitinase, 
chitosanase and p-l,3-glucanase) were induced in mycorrhiza. 
Although it is not clear whether nonsym-Hb is involved in the 
plant defense system directly, repression of the nonsym-Hb 
gene might facilitate the activation of plant defense system if 
nonsym-Hb plays a role as a NO scavenger in a normal plant 
cell. Further investigation on biochemical characterization will 
be required to understand the function of plant hemoglobins on 
symbiosis and the defense system. 

Materials and Methods 

Plants and microorganisms 

Lotus japonicus MG20 (Kawaguchi 2000) and Medicago sativa 
(Yukijirushi Shubyo Co. Ltd.) were used as the host plants. Mesorhizo- 
bium loti MAFF303099 (Saeki and Kouchi 2000, Kaneko et al. 2000) 
and Glomus sp. R-10 (Dr. Kinkon, ldemitsu Kosan Co, Ltd., Japan) 
were used as the symbiotic partner. 

Nodulation and mycorrhization 

For nodulation, the seedlings of L. japonicus were inoculated 
with M. loti 3 d after germination The inoculated seedlings were trans- 
ferred onto agar plates and were grown under a regime of 16 h light/ 
8 h dark at 25°C. The nodules for RNA isolation were harvested at 6 
weeks after inoculation. For mycorrhization, the inoculum and growth 
conditions were as described previously (Solaiman et al. 2000). The 
mycorrhizal colonization was confirmed by microscopy at 6 weeks 
after cultivation, and the whole root was used for RNA isolation. 

Isolation of total DNA and RNA from plants 

Genomic DNA of L. japonicus was isolated by extraction with 
CTAB (hexadecyltrimethyl ammonium bromide) and total RNA was 
isolated using a phenol-SDS procedure according to the method of 
Suzuki et aK (1997). 

PCR for amplification and identification of genomic hemoglobin clones 
ofL. japonicus 

The primer pairs for amplification of sym-Hb genes were 
designed based on the sequences of leghemoglobin homologues 
(AF000390, AF000405, AF000406, AV418770) found in the EST 
libraries of root nodules of L. japonicus reported by Szczyglowski et 
al. (1997) and Asamizu et al. (2000): LjLbFl, GGCATATGAAACAT- 
TCAAGAA; LjLbRl, AACCATTGTGACTATATATTTCQ LjN77Fl, 
GCAATAGTGTTCTGTTCTACACC; LjN77Rl, GCGAAACCAGA- 
AACATC. For screening of the 3-D pool of the genomic library of L. 
japonicus, other primer sets were used. The sequences of the primers 
were as follows: LjLbF2, AAAGACATGTTCTCCTTTCT; LjLblR, 
TGGAATACATATGGTTTATAG; LjLb2R, GCTATAAATTTCAACT- 
TCAG; LjLb3R, AACAAGTTTTATTGGAGTAC. The primer pairs 
for the nonsym-Hb gene were designed based on the sequence of a 
nonsym-Hb homologue (AV4 13959) found in ESTs of seedlings of L. 
japonicus constructed by Asamizu et al. (2000): LjNSGFl, TTCT- 
CACTTCACTTCCATCGC; LJNSGF2, TTGGTCAAGTCATGGAGCG; 
LjNSGRl, TCACAGTGACTTTTCCAGCG; LjNSGR2, AGACAGA- 
CATGGCATGAGGC. These primers for nonsym-Hb gene were also 
used for screening of the 3-D pool of the genomic library, PCR was 
performed in 30 cycles (94°C for 30 s, 55°C for 30 s, 72°C for 30 s). 
The PCR products were analyzed on 0.7% or 1.0% agarose gel 
depending on desired resolution. 



Southern hybridization 

The genomic DNA of L. japonicus was purified by CsCl-EtBr 
dye buoyant density gradient centrifugation and was digested with 
restriction endonucleases, Pst\, //mdlll, BamHl and £coRI. The 
digests (approx. 4 u.g) were separated on a 0.7% agarose gel, and then 
transferred to a nylon filter (Immobilon-Ny+; Millipore). Hybridiza- 
tion was carried out by the same conditions described in Suzuki et al. 
(2001). DNA probes for sym- and nonsym-Hb genes were obtained by 
PCR using corresponding TAC clone as a template. The sequences of 
primers used for amplification for each gene were as follows: AAA- 
GACATGTTCTCCTTTCT and TGGAATACATATGGTTATAG for 
LjLbl; TAGAAAGATCTGTTACCGGT and GCCATTATGGCTATA- 
AATTTCA for LjLb2; CAATGTTCCTATTGTCCACT and CTGC- 
CAAACAAAGAATAGCA for LjLb3; TTCTCACTTCACTTCCAT- 
CGC and TCACAGTGACTTTTCCAGCC for LjNSG 7. By using these 
primer sets, DNA fragments ranging from the second exon to 3'-UTR 
of LjLbl, ranging from the third intron to 3'-UTR of LjLbl, ranging 
from the first exon to the third exon of LjNSG], and a part of the third 
intron of LjLb3 were obtained, respectively. The PCR products were 
labeled with 32 P using Multiprime DNA labeling system (Amersham) 
and used as probes, 

DNA sequencing and sequence analysis 

Sequencing reactions were carried out according to the manufac- 
ture's instruction using the BigDye Terminator Cycle Sequencing FS 
Reaction Kit (Applied Biosy stems) and the sequences were read auto- 
matically using ABI PRISM 310 and 377, Applied Biosystems). Anal- 
yses of sequences and multiple alignment of putative amino acid 
sequences were performed by the Gene Works (Intel ligenetics) and 
Mac Vector programs (Genetics Computer Group). 

Analysis of expression of Hb genes by RT-PCR 

The expression of each Hb gene was detected and estimated by 
RT-PCR (OneStep RT-PCR; Qiagen) using specific primer pairs as fol- 
lows: LjLbF2 and LjLblR for LjLbl; LjLbF2 and LjLb2R for LjLb2; 
LjLbF2 and LjLb3R for LjLb3; LjNSGFl and LjNSGR2 for LjNSGl. 
For Fig. 6, first strand cDNA was synthesized by SuperScriptll (Invit- 
rogen) and ExTaq (TaKaRa) was used for amplification. The degener- 
ate primer pair for the three LjLbs was also used in Figs 6 and 7: 
PEALLLb-F, TCTGGRCCYAMGCAYACTC; PEALLLb-R CRTCR- 
CGWGTCAGTSCAAAA. For detection of the gene expression of 
Mhbl (nonsym-Hb gene of Medicago sativa, AF172172), the follow- 
ing primers were used: MhblF, CATGACATGTGAATCAGCCG; 
MhblR, CCAACTGATCATAAGCTTCTC. The genes for initiation 
factor 4 A3 of L. japonicus, LjelF-4A and 18S rRNA of M. sativa were 
used for positive control. The primers for detection of 1 8S rRNA of M. 
sativa were CCTAGTAAGCGCGAGTCATC and CATTCAATCGG- 
TAGGAGCGA. The amplified products were separated on 1% agar- 
ose gel electrophoresis. 

To quantify the relative amount of transcripts derived from each 
Hb gene, real time RT-PCR was employed. DNase I -treated total RNA 
(100 ng) was used as a template in a 50 ul of reaction mixture contain- 
ing 25 ul of SYBR Green PCR master mix (Applied Biosystems), 1 u.1 
of RNase inhibitor (40 U ul' 1 ), 0.25 ul of Superscript II-RT (Invitro- 
gen, 200 U fif 1 ), and 2 uJ of each primer (5 nM). RT reaction and 
PCR were performed using 7700 Sequence Detection System (Applied 
Biosystems). PCR was carried out 40 cycles of 95°C for 15 s and 60°C 
for 1 min. The sequences of primers for real time RT-PCR were as fol- 
lows: CCTTTGGAGGAGAACCCCAA and GAGCTGCTGATTCA- 
CAAGTCATG for LjNSGl; AGAGGGTTTAAAGATCAAAT and 
ATGTCAATTCATCACGTTTT for LjelF-4A. These sequences were 
designed with Primer Express program (Applied Biosystems). The 
primer pairs suitable for independent detection of the three LjLbs could 
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not be designed by this program. The gene for initiation factor 4 A3 of 
L. japonicusy LjeIF-4A, was used as an internal standard for RT-PCR 
and was used for normalization of the results of quantitative RT-PCR. 
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Abstract 



Hemoglobins (Hbs) are heme proteins encountered in all five kingdoms of living organisms. In plants, two different classes of Hbs have 
been identified: nonsymbiotic (class I) from both monocot and dicot species and symbiotic (class II) Hbs from nitrogen-fixing plants. This 
work reports the cloning and analysis of three nonsymbiotic Hb genes from wheat {Triticum aestivum) and potato (Solarium tuberosum). The 
Hb cDNAs were amplified by reverse transcriptase polymerase chain reaction (RT-PCR) using consensus oligonucleotide primers for 
nonsymbiotic Hbs. 

A wheat Hb cDNA (TaHbl) was isolated and shows a very high similarity to nonsymbiotic Hbs from Hordeum vulgare (98%) and Zea 
mays (83%). Another wheat Hb cDNA, designated TaHb2, exhibited strong similarity to truncated bacterial Hbs, the so-called 2-on-2 Hbs. In 
addition, a third Hb was cloned from potato, StHb. Expression analysis by RT-PCR demonstrated a very high expression level of the TaHbl 
gene only in wheat roots. In contrast, the other wheat hemoglobin gene, TaHb2, was demonstrated to be constitutively expressed although 
differences in expression level in different tissues were observed. The expression of the TaHbl gene is induced in wheat roots exposed to 
microaerobic conditions. The potato Hb gene, StHb, was highly expressed in roots and also in tubers and stem tissue although at much 
reduced levels. 

© 2003 Elsevier Science B.V. All rights reserved. 
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Hemoglobins (Hbs) are heme proteins encountered in all 
five kingdoms of living organisms whose primary function 
is the binding and transporting oxygen and other gaseous 
ligands [1,2]. Hbs function by means of reversible binding 
in the storage and transportation of oxygen and by facilitat- 
ing its diffusion. In plants, three different types of Hbs have 
been described: symbiotic, nonsymbiotic and truncated Hbs. 
The symbiotic Hbs are found in leguminous plants and in 
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actinorhizal plants existing in symbiosis with N 2 -fixing 
organisms [3]. However, symbiotic-like Hbs are also found 
in plants as Cichorium and display a close sequence 
similarity to symbiotic Hbs [4]. The symbiotic Hbs function 
as oxygen buffers and carriers and provide O2 to the 
respirating micro-symbionts actively fixing N 2 in tissues. 
Symbiotic Hbs also prevent inhibition of the nitrogenase 
complex by binding O2 tightly and facilitating its diffusion 
to N 2 -fixing bacteroids. 

Nonsymbiotic Hbs are ubiquitous in plant species and 
have been divided into two classes: Class II with oxygen- 
binding properties like symbiotic Hbs and class I with 
significantly different oxygen-binding properties. The first 
nonsymbiotic Hb was identified in barley and the identity 
established from its similarity to other nonlegume Hbs [5]. 
Since then nonsymbiotic Hbs have been found in virtually 
all plant species examined including liverworts (Marchantia 
polymorpha) [6], mosses (Physcomitrella patens) [7], the 
monocots rice, maize and teosinte [8-10], and the dicots 
soybean, Arabidopsis [11,12]. The nonsymbiotic Hbs dis- 
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play expression patterns different from the symbiotic Hbs 
and are generally expressed at low levels and found in a 
variety of tissues including roots, stems and leaves of both 
monocots and dicots [13]. The expression patterns of non- 
symbiotic Hb genes varies significantly among different 
species, and often the highest levels of expression are 
detected in metabolically active or stressed organs/tissues. 
A number of nonsymbiotic Hbs belonging to class I have 
been shown to be hypoxia-inducible in plants such as barley 
[14], maize [15], rice [16] and alfalfa [17,18], and are often 
referred to as stress-induced Hb. 

The function of nonsymbiotic Hb in plants has recently 
been reviewed by Ross [19] and Dordas et al. [20]. Non- 
symbiotic Hbs might be oxygen-transporter proteins that 
scavenge oxygen during hypoxia and supply it for aerobic 
respiration, or Hbs can act as terminal NADH oxidases that 
facilitate glycolytic formation of ATP under hypoxic con- 
ditions. A third possible function for Hbs could be as 
oxygen-sensing proteins that activate other regulatory pro- 
teins. It seems most plausible that nonsymbiotic Hbs are 
involved in maintaining the energy status of plant cells 
exposed to low oxygen concentrations. Dordas et al. [20] 
propose a function of hypoxia stress-induced Hb in modu- 
lation of nitric oxide in the plant cell. Recently the first 
crystal structure of a nonsymbiotic rice Hb (Hbl) was 
determined [21]. However, the functional relevance of the 
quaternary structure is still unclear. 

Recently a third group of Hbs, originally identified in 
prokaryotes, protozoa and algae, has also been demonstrated 
to be present in plants. A so-called truncated Hb was 
isolated from Arabidopsis thaliana [22]. The Arabidopsis 
GLB3 protein has a three-dimensional structure similar to 
the characteristic 2-on-2 arrangement a-helices, distinct 
from the 3-on-3 arrangement of the standard globin fold 
[23]. Expressed sequence tags with high similarity to A. 
thaliana GLB3 have been identified in both dicots and 
monocots including barley and maize. GLB3 is expressed 
throughout the plant and does not respond to hypoxia or 
cold stress and, hence, GLB3 expression is regulated differ- 
ently from the two other Arabidopsis Hbs. 

In the present study we have undertaken the cloning and 
analysis of wheat and potato Hb genes. The spatial expres- 
sion pattern of the two wheat Hb genes was investigated in 
response to limiting oxygen stress. 

Tissues of wheat (Triticum aestivum var. Complet) used 
in the reported experiments were collected from field-grown 
plants (Danish Institute of Agricultural Sciences, Tjele, 
Denmark) or grown in a hydroponic culture. Potato (Sol- 
lanum tuberosum cv 68-DCU-28, a tetraploid) plants grown 
under green-house conditions were also used as plant source 
for this experiment. Plant material used for reverse tran- 
scriptase polymerase chain reaction (RT-PCR) was imme- 
diately frozen in liquid nitrogen after harvest and stored at 
- 80 °C. Nucleic acids were extracted and purified from 
deep-frozen plant material. Total RNA was extracted by 
employing the RNeasy Plant Mini Kit from Qiagen accord- 



ing to the manufacturer's instructions. RNA concentrations 
were determined by UV spectrometry. 

Total RNA was isolated from wheat seedlings and potato 
stems as described above. After treatment with RNase-free 
DNase, 5 ug of total RNA was used in an RT-PCR one-tube 
reaction employing a RobusT RT-PCR Kit (Finnzymes). 
Oligonucleotide primers were derived from putative Hb 
sequences in the EST databases found in TIGR (http:// 
www.tigr.org/tdb/). The RT-PCR reaction mix contained: 5 
Ug total RNA, 1.5 mM MgCl 2 , 0.2 mM dNTP, 0.5 uM of 
each primer (TaHbl-F: 5'-GGGAGGGAGGAAGCCATGT- 
CTGCCGCGGAG-3' and TaHbl-R: 5'-ACAATGGATGA- 
GCTATTCAGAGGGCTTCAT-3'~f67cloning oftrie~wheat 
Hbl gene, TaHb2-F: 5'-GGCGCTGCAGGAACGATG- 
CAGTCGCTGCAGG-3' and TaHb2-R: 5'-CCGG- 
TTAAGTTTGAGCGGTTACTCGGCTGG-3' for cloning 
of the wheat Hb2 gene and StHb-F: 5'-AAAGTTTGATCA- 
CAATCATCATGAGTAGCTTT-3' and StHb-R: 5'- 
GAGCCGATAGTCCATCGGAATCAGTCC-3' for cloning 
of the potato Hb gene), 20 units of DyNAzyme EXT DN A 
polymerase (Finnzymes), 5 units of AMV reverse tran- 
scriptase (Finnzymes) and 20 units of RNase inhibitor 
(Roche) in a total volume of 50 ul. Synthesis of cDNA, 
inactivation of AMV reverse transcriptase and denaturation 
of cDNA/mRNA hybrids were performed by one cycle of: 
50 °C for 60 min followed by 94 °C for 2 min. PCR 
amplification was carried out in the total volume by the use 
of the following program: 30 cycles of 45 °C for 30 s 
followed by 72 °C for 1 min 30 s. The PCR program was 
concluded by an extension at 72 °C for 7 min. Ten-micro- 
liter PCR amplification products were applied on a 1% 
agarose gel and visualized after electrophoresis by ethidium 
bromide staining. Fluorescent bands of sizes of approxi- 
mately 500 bp for TaHbl, 550 bp for TaHbl and 650 bp for 
StHb were cut out and eluted using the Qiaquick Gel 
Extraction kit from Qiagen. The eluted PCR products were 
cloned into the pCR TOPO 2.1 vector (Invitrogen) and 
sequenced in both directions employing the dideoxy chain 
termination method using BigDye terminator cycle sequenc- 
ing kit with AmpliTaq DNA polymerase FS (PE Applied 
Biosystems). The sequencing analysis was performed on an 
automated DNA sequencer (ABI PRISM™ Genetic Ana- 
lyzer Model 373/377) (PE Applied Biosystems) using 
facilities within the Institute of Molecular and Structural 
Biology, University of Aarhus, Denmark. All DNA frag- 
ments were sequenced in both directions. 

The identities of TaHbl, TaHb2 and StHb were demon- 
strated by comparison of the deduced polypeptide sequences 
to other isolated plant Hbs. The wheat hemoglobin 1 (Hbl) 
cDNA, TaHbl, consists of a 516-bp fragment with the 
translational start site of the major open reading frame 
(ORF) at nucleotide 16 and the TAG stop site at nucleotide 
504. The deduced amino acid sequence of TaHbl is shown 
in Fig. 1. Comparison of the length of the wheat Hbl with 
other Hb genes indicates that TaHbl is not full-length and 
that both 5'UTR and 3'UTR are incomplete. The transla- 
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PotatoHb. (d) 
TomatoHb. (d) 
CasuarHb2 .(d) 
SoybeanHb. (d) 
CottonHb. (d) 
TremaHb. (d) 
ParaspHb. (d) 
AraHbl. (d) 
WheatHbl . (m) 
BarleyHb. (m) 
MaizeHb. (m) 
RiceHbl.- (m) 
RiceHb2. (m) 



MSS FSEEQEALWKSWGSMKKDAGEWGLKFFLKIFEIAPSAKKMFSF 4 7 

MSS FSEEQEALWKSWGSMKKDAGEWGLKFFLKIFEIAPSAKKMFSF 4 7 

MSTLEGR G- FTEEQEALWKSWSAMKPNAGELGLKFFLKI FEI APSAQKLFS F 52 

MTTTLER G - FSEEQEALWKSWNVMKKNSGELGLKFFLKI FEI APSAQKLFS F 5 2 

- - -MATYEGK V- FTEEQEALWKSWTVMKKKTAELGLKFFLKI FE IAPSAKKLFSF 52 

- - -MSSSEVD KVFTEEQEALWKSWAVMKKNSAELGLKFFLKIFEIAPSANNLFSY 53 

MSSSEVN KVFTEEQEALWKAWAVMKKNSAELGLQFFLKIFEIAPSAKNLFSY 53 

MES-EGK IVFTEEQEALWKSWSVMKKNSAELGLKLFIKIFEIAPTTKKMFSF 52 

MSAAERA- WFSEEKDALVLKSWAIMKKDSANLGLRFFLKI FEIAPSARQMFPF 53 

- - -MSAAEGA WFSEEKEALVLKSWAIMKKDSANLGLRFFLKI FEIAPSARQMFPF 53 

MALAEADDGA WFGEEQEALVLKSWAVMKKDAANLGLRFFLKVFEIAPSAEQMFSF 5 6 

MALVEDNNAV - - -AVSFSEEQEALVLKSWAILKKDSANIALRFFLKIFEVAPSASQMFSF 57 
MALVEGNNGVSGGAVSFSEEQEALVLKSWAIMKKDSANIGLRFFLKIFEVAPSASQMFSF 60 

* **..***.*.* .+ ... .#..*.*.**.**.. 



PotatoHb. (d) LKDSNVPLDQNPKLKVHAKSILVMTCEAAVQLRKAGKVVVRDSTLKKIGATHFKYGVADE 107 

TomatoHb. (d) L KDSNVP LDQN PKL KI HAKS VL VMTCEAAVQLRKAGKVVVRDS TLKKI GATHFKYG VVDE 107 

CasuarHb2. (d) LKDSNVPLERNPKLKSHAMSVFLMTCESAVQLRKAGKVTWESSLKKLGASHFKHGVADE 112 

SoybeanHb. (d) LRDSTVPLEQNPKLKPHAVSVFVMTCDSAVQLRKAGKVTVRESNLKKLGATHFRTGVANE 112 

CottonHb. (d) LRDSNVPLEQNTKLKPHAMSVFVMTCESAVQLRKAGKVTVRESNLKKLGATHFKYGVVDE 112 

TremaHb. {d) LKDSPIPLEQNPKLKPHAMTVFVMTCESAVQFRKAGKVTVRESNLKRIGAIHFKNGVVHE 113 

ParaspHb. (d) L KDS P VPL EQN P KL KPHATTV FVMT C E S A VQLRKAG KVTVKES DLKR I G A I HF KTG WNE 113 

AraHbl. (d) LRDSPIPAEQNPKLKPHAMSVFVMCCESAVQLRKTGKVTVRETTLKRLGASHSKYGVVDE 112 

WheatHbl. (m) LRDSDVPLETNPKLKTHAVSVFVMTCEAAAQLRKAGKITVRETTLKRLGGTHLKYGVADG 113 

BarleyHb. (m) LRDSDVPLETNPKLKTHAVSVFVMTCEAAAQLRKAGKITVRETTLKRLGGTHLKYGVADG 113 

MaizeHb. (m) LRDSDVPLEKNPICLKTHAMSVFVMTCEAAAQLRKAGKVTVRETTLKRLGATHLRYGVADG 116 

RiceHbl. (m) LRNSDVPLEKNPKLKTHAMSVF\^TCFJ^QLRKAGKVTVRDTTLKRLGATHLKYGVGDA 117 

RiceHb2.(m) LRNSDVPLEKNPKLKTHAMSVFVMTCEAAAQLRKAGKVTVRDTTLKRLGATHFKYGVGDA 120 



. * * * * + 



PotatoHb. (d) HFEVTKYALLET I KEAS P - EMWS VEMKNAWGEAYDQLVSAI KTEMK 152 

TomatoHb. (d) HFEVTKYALLET I KEASQ- EMWS VEMKNAWGEAYDQLVSAI KTEMK 152 

CasuarHb2 . (d) HFEVTKFALLETI KEAVP - ETWS PEMKNAWGEAYDKLVAAI KLEMKPSS 160 

SoybeanHb. (d) HFEVTKFALLETI KEAVP - EMWSPAMKNAWGEAYDQLVDAI KSEMKPPSS - - 161 

CottonHb. {d} HFEVTKFALLETI KEAVP - DMWSDEMKNAWGEAYDRLVAAI KIEMKACSQAA 163 

TremaHb. (d) HFEVTRFALLETIKEAVP-EMWSPEMKNAWGEAYDQLVAAIKFEVKPSST- - 162 

ParaspHb. (d) HFEVTRFALLETI KEAVP - EMWS PEMKNAWGVAYDQLVAAI KFEMKPSST- - 162 

AraHbl. (d) HFEVAKYALLETI KEAVP - EMWS PEMKVAWGQAYDHLVAAI KAEMNLSN 160 

WheatHbl. (m) HFEVTRFALLETI KEALPADMWGPEMRNAWGEAYDQLVAAIKQEMKPSE 162 

BarleyHb. (m) HFEVTRFALLETI KEALPADMWGPEMRNAWGEAYDQLVAAIKQEMKPAE 162 

MaizeHb. (m) HFEVTGFALLETIKEALPADMWSLEMKKAWAEAYSQLVAAIKREMKPDA 16 5 

RiceHbl. (tn) HFEWKFALLDTI KEEVPADMWS PAMKSAWSEAYDHLVAAI KQEMKPAE 166 

RiceHb2. (m) 



HFEVTRFALLETI KEAVPVDMWS PAMKS AWS EAYNQLVAAI KQEMKPAE 169 



**-**. . * ** . 



. ** . .**.**★ 



Fig. 1. Multiple alignment of TaHbl and StHb with hemoglobins from other plant species obtained by Clustal W. The numbers represent the position of the 
amino acids in the respective protein sequences. Identical amino acid residues in all sequences are indicated by asterisks. Semicolon indicates residues that arc 
mostly conserved (identified in at least 8 out of 13 amino acids). Dashes within a sequence indicate gaps inserted to optimize alignment. The accession numbers 
of the sequences used in the comparison are: potatoHb, S. tuberosum, AY 15 1389; tomatoHb, L. esculentum, AY026343; CasuarinaHb2, C glauca, X53950; 
SoybeanHb, G. max, U47143; CottonHb, G. hirsutum, AF329368; TremaHb, T. orientalis, Z99635; ParasponiaHb, P. andersonii, U27194, U94998; WheatHb, 
T. aestivum, AY151390; BarleyHb, H. vulgare, U94968; MaizeHb, Z mays, AF236080; RiceHbl, O. sativa, AF335504; RiceHb2, Osativa, U7603I. 
Alignments of sequences were performed using the Clustal W program on EBI WWW molecular biology server. 



tional start site is in accordance with the plant consensus 
having a C at position + 5. No putative poly(A + ) signal is 
observed due to lack of complete 3'UTR. The G + C content 
of the coding region is typical for a graminaceous monocot 
52% [24], and it codes for a polypeptide of 162 amino acids 
with an estimated molecular mass of 18.1 kDa and pi of 8.7 
(calculated using the sequence analysis tools of http:// 
www.expasy.org/tools/dna.htmI). 

The cDNA of the other isolated wheat Hb gene, TaHb2 y 
is 541 -bp long with the translational start site at nucleotide 
16 and the TAA stop site at nucleotide 531. The ORF of 
TaHb2 has a G + C content of 56% and it encodes a 
polypeptide of 171 amino acids with an estimated molecular 
mass of 19.5 kDa and pi of 5.6. The deduced amino acid 
sequence is shown in Fig. 2. The nomenclature 2-on-2 is 



referring to the arrangement of the a-helices in the three- 
dimensional structure of these Hbs. 

The cDNA of the potato Hb gene (StHb) is 646-bp long 
with the translational start site at nucleotide 22 and the TAG 
stop at nucleotide 480. The ORF of StHb has a G + C 
content of 40% and it encodes a polypeptide of 152 amino 
acids with an estimated molecular mass of 17.1 kDa and pi 
of 8.9. The translational start of the StHb cDNA matches the 
plant consensus having an A at position - 3. The lengths of 
most published Hb cDNAs are in the range of 600-900 bp 
and hence the TaHbl, TaHb2 and StHb cDNAs do not 
represent fiill-length cDNA clones. Obviously, both 5'UTR 
and 3'UTR sequences are missing. 

Amino acid sequence similarity between TaHbl, StHb 
and other published plant Hbs was determined by the Clustal 
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TaHb2 .2-on-2 MQSLQDKASEWSGVAAADAFAIDEVNVFEALGGTPQPFVDLSTNFYTRVYEDEEEWFPEI 60 
HvHb.2-on2 MQSLQDKASEWSGVAAADAFAIDEVNVFEALGGTPQPFVDLSTNFYTRVYEDEEEWFREI 60 

AtHb.2-on-2 MQSLQDKASVLSGVDQAEAFAIDESNLFDKLG- -LQTFINLSTNFYTRVYDDEEEWFQSI 58 



********* *** 



****** * * ** * * 



********* ****** * 



TaHb2 . 2-on-2 FSGSRKEDAIQNQYEFLVQRMGGPPLFSQRRGHPALIGRHRPFPVTHQAAERWLHHMQQA 120 
HvHb . 2 -on- 2 F SGS KKEDA I QNQ Y E FL VQRMGG P P L FSQRRGH PAL I GRH R P F P VTHRAAERWLHHMQQA 120 

AtHb.2-on-2 F SNSNKED AI QNQ YEFF VQRMGGP P L Y S QR KGH P AL I GRHR P F P VTHQ AAERWLEHMQNA 118 

** * *********** *********;***;**************** ****** *** * 

TaHb2.2-on-2 LETTES INPDTKTKMMI FFRHTAYFLVAGNEMTRQTQSVP PCKHATSKPAE- 171 

HvHb. 2 -on -2 LETTQS I NPDTKTKMMN FFRHTAYFLVAGNEMTRQTQSVP PCKHATNKPAE- 171 

AtHb.2-on-2 LDDSVDIDQDSKIKMMKFFRHTAFFLVAGNELKNQNEKPKHKPQCACKPAANKPAEE 175 

* * * * *** ****** ******* * ** * **** 

Fig. 2. Alignment of 2-on-2 plant hemoglobins obtained by Clustal W. The numbers indicate the position of the amino acids in the respective protein sequences. 
Identical amino acid residues in all three sequences are indicated by an asterisk. Dashes within a sequence indicate gaps inserted to o ptimize ali gnment. The 
accession numbers of the sequences used in the comparison arc: TaHb2.2-on-2, T. aestivum, AY 1 5 1 39 1 ; HvHb.2-on-2, K vulgare, AF376063; AtHb.2-on-2, A. 
thaliana, AF376062. Alignment of sequences was performed using the Clustal W program on EBI WWW molecular biology server. 



method. A substantial degree of identity between TaHbl, 
StHb and most other Hb was found. The encoded TaHbl 
polypeptide exhibits significant sequence similarity to Hbs 
from different plants with highest identity to Hbs from 
Hordeiim vulgare (98%) and Zea mays (83%). Generally, 
the highest similarity values are exhibited between TaHbl 
and Hbs from other monocot species. The lowest amino acid 
identity between TaHbl and other Hbs was, not surprisingly, 
to Hbs from dicot species as Citrus unshiu (64%) and A. 
thaliana (66%). The StHb amino acid sequence showed 
highest identity to Hbs from the dicots to tomato (Lycopersi- 
icon esculaentum), which like potato is a member of Sol- 
anaceae, and to Gyssopium hirsutum with values of 97% and 
80%, respectively. TaHb2 exhibits high sequence similarity 
to 2-on-2 Hbs identified from different plants with highest 
identity (96%) to GLB3 from H. vulgare and GLB3 from Z 
mays (88%) and also to A. thaliana 2-on-2 Hb (71%). 

Multiple alignment of TaHbl, StHb and other plant 
nonsymbiotic Hb sequences (Fig. 1) revealed several 
regions of high homology. The highly conserved phenyl- 
alanine residue found in many known plant nonsymbiotic 
Hbs is also present in TaHbl and StHb in positions F51 and 
F45, respectively. A cysteine residue found in nonsymbiotic 
Hbs is also present in TaHbl (C79) and StHb (C73). With 
the exception of N at position 64, the TaHbl contains the 
signature of plant Hbs (S-P-X-L-X-X-H-A-X-X-X-F). 
Sequence comparison also shows that wheat Hbl contains 
distal (H70) and proximal (HI 05) histidine residues, as well 
as the P45, F51, F75 and Fl 15 that are conserved in plant 
Hbs (Fig. 1). TaHbl also contains a single Cys residue, C79, 
that is highly conserved in nonlegume Hbs. With the 
exception of N and L at positions 58 and 69, respectively, 
the StHb contains the consensus of plant Hbs (S-P-X-L-X- 
X-H-A-X-X-X-F) as well as the distal (H64) and proximal 
(H99) histidine residues, P39 and F109, also conserved in 
plant Hbs. The highly conserved F69 of known plant Hbs is 
a leucine residue in StHb. A motif (N-P-X-L-X-X-H-A-X- 
X-X-F) is conserved in the both encoded TaHbl and StHb 
amino acid sequences. The asparagine residue in the amino 
terminal of this sequence seems to be conserved in non- 
symbiotic Hbs. 



An alignment of TaHb2 with 2-on-2 Hbs from barley and 
Arabidopsis is shown in Fig. 2. The alignment reveals a 
remarkably high homology between the wheat and barley 
sequences and a significantly lower homology to the Arabi- 
dopsis 2-on-2 Hb. A phylogenetic analysis of the four 
different 2-on-2 Hbs demonstrated that barley, wheat and 
maize group together and different from the Arabidopsis 
GLB3 (data not shown). 

The hydropathy profiles of the predicted TaHbl and StHb 
proteins were determined by computer analysis using the 
protscale program (http://www.expasy.ch/cgi-bin/protscale). 
The hydropathy analysis of TaHbl and StHb (data not 
presented) showed that both have a similar profile to other 
plant Hbs. This suggests that the predicted tertiary structure 
of TaHbl and StHb is predominantly formed by ot-helices. 
The results showed that the hydropathy profiles of TaHbl 
and StHb are very similar to that of recombinant rice Hbl 
indicating that the tertiary structure of TaHbl and StHb is 
identical to that of rice Hbl. The crystal structure of a 
nonsymbiotic rice Hb has recently been reported and the 
elucidated tertiary structure of the rice Hbl demonstrated it 
to be a dimer [21]. 

In an effort to determine the evolutionary relationships 
between members of the Hbs, a phylogenetic tree was 
generated using computer software (MEGAL1GN program, 
Clustal method). A phylogenetic analysis of 13 different 
plant Hbs has been carried out (Fig. 3). As seen in the 
evolutionary tree of the amino acid sequences, Hbs separate 
into four groups reflecting their evolution. The phylogenetic 
analysis shows that TaHbl clusters with other monocot Hbs 
and forms a group distinct from nonsymbiotic dicot Hbs. 
Three distinct groups contain all dicot Hbs except cotton 
Hb. The wheat Hbl clearly belongs to the monocotyledo- 
nous clade clustering together with barley and maize Hb and 
groups individually and different from the rice Hbl and 
Hb2. The potato Hb is found together with tomato Hb but, 
surprisingly, groups with all the monocot nonsymbiotic Hb 
polypeptide sequences. It is noteworthy that all other dicot 
Hbs fall into another side of the root and that the cotton Hb 
seems to belong to a third distinct group of nonsymbiotic 
Hbs. 
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Potato Hb. 
Tomato Hb. 



"L-Bar 



Wheat Hbi. 



Barley Hb. 
Maize Hb. 



-Rice Hb1. 



-Rice Hb2. 



-Trema Hb. 
Parasp Hb. 



-Ara Hb1. 



- Cotton Hb. 

Casuar Hb2. 

Soybean Hb. 



Fig. 3. Phylogcnctic tree of TaHbl, StHb and other plant Hbs. The tree was constructed using the using the clustal method of DNASTAR Megalign 
(DNASTAR, Inc., Madison, WI) based on amino acid similarities of the full sequences. The accession numbers of the sequences used for construction of the 
phylogcnctic tree arc listed in the legend to Fig. 2. 



To determine the spatial expression of the Hb genes 
isolated from wheat, TaHbl and TaHb2, semiquantitative 
RT-PCR analysis was carried out with total RNA extracted 
from root, stem, seminal leaf, leaf and flowers of wheat. As 
illustrated in Fig. 4 A by a strong fluorescent 516-bp band, 
the RT-PCR analysis demonstrated a very high expression 
level of TaHbl in wheat root tissue and a very low level of 
expression in cotyledons. No TaHbl transcript could be 
detected in stems, leaves and flowers of wheat. The RT-PCR 
analysis also revealed a TaHb2 transcript of 541 bp in all 
organs of wheat, the level of expression varying between 




TaHbl 



TaHb2 



organs (Fig. 4B). TaHbl transcript is more abundant in 
flower, stems and leaves than in roots. 

A constitutive expression was also demonstrated for 
another 2-on-2 Hb, GLB3 from A. thaliana with a slightly 
increased expression in roots [22]. In leaves, transcript 
abundance increases with maturity as almost no TaHb2 
transcript is detected in seminal leaf. The gene-specific 
primers for ryegrass glyceraldehyde-3 -phosphate dehydro- 
genase (GAPDH) were used as internal controls in order to 
assure that equal amounts of cDNA were used in each 
individual RT-PCR experiment. A 380-bp band representing 
the wheat GAPDH mRNA revealed an expression pattern 
with equally intense bands, indicating that equal amounts of 
wheat RNA have been used. 

The effect of oxygen concentration on the expression of 
wheat Hbs and GAPDH was investigated by RT-PCR. 
Twelve-day-old wheat seedlings were incubated under 16 
h of controlled oxygen tensions. Marmalade jars were 
flushed with high purity N 2 for 10 min and the 0 2 
concentration was adjusted to appropriate concentrations. 
The oxygen concentrations were measured at t 0 = 0 h and 
/= 16 h using a gas chromatograph (Varian Star 3400CX). 
A flooding experiment was also conducted with wheat 
seedlings where the entire root system was completely 
flooded for 16 h. Control wheat plants were grown on 
wetted filter paper. Total RNA was isolated from root and 



I GAPDH 




StHb 600bp 



GAPDH 380bp 



Fig. 4. RT-PCR expression analysis of TaHbl, TaHb2 and StHb. The sizes 
of the TaHbl, TaHb2 and StHb and GAPDH fragments were 516, 541, 646 
and 380 bp, respectively. 
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Fig. 5. RT-PCR. expression analysis of TaHbl in wheat roots exposed to 
different oxygen concentrations. 
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leaf of the wheat seedlings as described above and 1 |ig 
was used in an RT-PCR reaction with the same composition 
as described earlier in this paragraph. The results presented 
in Fig. 5 clearly demonstrated that the expression of TaHbl 
is induced in wheat roots exposed to anaerobic conditions 
and that the highest expression level was obtained with an 
oxygen concentration of 0.5% and a slightly lower expres- 
sion at 2.5%. TaHbl is clearly induced by hypoxia and 
hence it can be grouped together with class I Hbs. The 
GLB1 (earlier nomenclature Ahbl) from Arabidopsis and 
other class I Hbs display similar behaviour [5,17,25], 
Surprisingly, the roots of flooding-stressed wheat plants 
did not show any increased level of TaHbl transcript 
compared to roots from normally grown plants. The lacking 
response in expression of TaHbl in flooded roots can 
possibly be explained by the experimental design where 
only the roots were flooded. Either oxygen diffusion in the 
water covering the roots or oxygen supplied to the roots 
from the aerial part of the wheat seedling could have 
created a higher oxygen tension than intended. The expres- 
sion of both TaHb2 and GAPDH was unaffected of the 
different anaerobic conditions. The expression of potato 
Hb, StHb, was determined by semiquantitative RT-PCR of 
total RNA extracted from potato tuber, root, stem and leaf 
with an expected amplification product of 646 bp As 
illustrated in Fig. 4C, a high StHb expression level was 
detected in potato root tissue and also in tuber tissue 
although at a reduced level. Avery faint signal representing 
a low level of potato Hb transcript was observed in stem 
tissue. No StHb transcript could be detected in potato leaf 
tissue. RT-PCR experiments were also carried out without 
reverse transcriptase and no fluorescent bands were 
detected, indicating no contamination with genomic DNA 
in the RNA preparations (data not shown). The gene- 
specific primers for ryegrass GAPDH were used as internal 
controls in order to assure that equal amounts of cDNA 
were used in each individual RT-PCR experiment. A 380- 
bp band representing the GAPDH mRNA was represented 
in root, stem and leaf tissue of potato at the same fluo- 
rescence intensity, indicating that equal amounts of cDNA 
were used. 

The cloning of three different cDNAs encoding nonsym- 
biotic Hbs is reported here. Two Hbs were isolated from 
wheat and one Hb from potato. The identity of the Hb 
cDNAs was demonstrated by comparison with peptide 
sequence data from other known plant Hbs. Functional 
expression of TaHbl, TaHb2 and StHb as recombinant 
proteins is needed for final confirmation of the identity of 
the Hbs and to verify that the recombinant versions possess 
the functionality characteristic of Hbs. The data obtained 
from this work suggest that TaHbl is important for the 
development of normal wheat tissues. Under conditions of 
hypoxia, increased Hbl synthesis in wheat does not appear 
to be a general response. Rather, the response may be 
confined to tissues stressed for energy supply and for 
oxygen limitations. 
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duplication; We isolated and sequenced two haemoglobin genes from the early-branching 

nonsynonymous substitution rate; angiosperm Euryale ferox (Nymphaeaceae). The two genes belong to the two 

Nymphaeaceae; known classes of plant haemoglobin. Their existence in Nymphaeaceae 

plant haemoglobin; supports the theory that class 1 haemoglobin was ancestrally present in all 

symbiosis. angiosperms, and is evidence for class 2 haemoglobin being widely distributed. 

These sequences allowed us to unambiguously root the angiosperm haemo- 
globin phylogeny, and to corroborate the hypothesis that the class 1 /class 2 
duplication event occurred before the divergence between monocots and 
eudicots. We addressed the molecular evolution of plant haemoglobin by 
comparing the synonymous and nonsynonymous substitution rates in various 
groups of genes. Class 2 haemoglobin genes of legumes (functionally involved 
in a symbiosis with nitrogen-fixing bacteria) show a higher nonsynonymous 
substitution rate than class 1 (nonsymbiotic) haemoglobin genes. This suggests 
that a change in the selective forces applying to plant haemoglobins has 
occurred during the evolutionary history of this gene family, potentially in 
relation with the evolution of symbiosis. 



Introduction 

Two types of haemoglobin, called class 1 and class 2 
haemoglobin, occur in vascular plants. They are distinct 
from each other through their function and expression 
pattern (Jacobsen-Lyon et al, 1995; Trevaskis et al, 1997; 
Hunt et al, 2001). Class 1 haemoglobins have been found 
in both monocots and eudicots and show a conserved 
expression pattern (Hunt et ai, 2001). They are expressed 
in germinating seeds [barley (Duff et ai, 1998)], in roots 
of mature plants {Parasponia, Trema and Casuarina GLB1 
(Bogusz et al, 1988; Bogusz et al, 1990; Jacobsen-Lyon 
et al, 1995; Andersson et al, 1997; Franche et al, 1998) 
and rosette leaves subjected to hypoxic conditions, and in 
response to sucrose [Arabidopsis (Trevaskis et al, 1997)]. 
Class 2 haemoglobins were first . described in plants 
achieving symbiosis with nitrogen-fixing bacteria (sym- 
biotic haemoglobins). In most legume species (symbiosis 
with Rhizobium and Bradyrhizobium) and in several 
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species from Fagales, Rosales and Cucurbitales (symbiosis 
with Frankia), class 2 haemoglobins are expressed at high 
concentration in nodules and facilitate oxygen diffusion 
to the nitrogen-fixing symbionts (Appleby, 1992). Class 2 
haemoglobins were also found in several nonsymbiotic 
eudicots (nonsymbiotic haemoglobins), where their 
function remains unclear. In these species, they 
are expressed in several tissues (Szabados et al, 1990; 
Strozycki et al, 2000), and might participate in embry- 
ogenesis and seed maturation [Cichorium (Hendriks et al, 
1998), Arabidopsis (Hunt et al, 2001)]. At this moment no 
class 2 haemoglobin gene has been found in any 
monocot. 

Expression patterns, therefore, suggest that the evolu- 
tion of haemoglobin may have played a role in the 
evolution of symbiosis between angiosperms and nitro- 
gen-fixing bacteria, stimulating research about the 
molecular phylogeny and evolution of this gene family. 
Hypotheses about plant haemoglobin evolution have 
gradually progressed thanks to the characterization of 
new sequences (Andersson et al, 1996; Trevaskis et al, 
1997; Arredondo-Peter et al, 1998; Strozycki et al, 
2000). We are a long way from the first postulate of a 
horizontal transfer of the haemoglobin gene from the 
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animal kingdom to a progenitor of the dicotyledonous 
angiosperms. Molecular phylogeny analyses support the 
grouping of class 1 vs. class 2 haemoglobins (Andersson 
et al, 1996; Trevaskis et al, 1997; Arredondo-Peter et al, 
1998; Strozycki et al, 2000), the two clusters being 
separated by a long internal branch. The recent identi- 
fication of haemoglobin genes in nonvascular plants 
(liverwort Marchantia polymorpha (Hunt et al, 2001) and 
mosses Physcomitrella patens (Arredondo-Peter et al, 
2000) and Ceratodon purpureus (accession number 
AF309562) provides a way to root the angiosperm 
haemoglobin phylogeny, potentially casting light on the 
evolutionary history of this gene family. According to 
Hunt et al. (2001), the root of the tree takes place 
between the class 1 and class 2 clusters (Fig. la), 
suggesting that class 1 and class 2 haemoglobins arose 
through an ancient duplication event that occurred 
before the divergence between monocots and eudicots. 
This hypothesis requires that class 2 genes have been lost 
in the monocot lineage. 

Nonvascular plants, however, are quite divergent from 
angiosperms (Soltis et al, 1999), casting doubts on the 
location of the root of the haemoglobin tree. The so- 
called 'long branch attraction' artefact (Felsenstein, 
1978) may be responsible for the moss sequences being 
connected to the long branch that separates class 1 from 
class 2 haemoglobins, rather than to any other branch in 
the un-rooted tree. An alternative scenario involving a 
eudicot-specific duplication event, and paraphyletic class 
1 haemoglobins, would be more parsimonious with 
respect to gene gain/loss (Fig. lb). 

Also, the published haemoglobin trees suggest that the 
rate of amino-acid substitution varies between lineages 
(Strozycki &- Legocki, 1995; Strozycki et al, 2000). The 
divergence between class 2 haemoglobins from various 
legume species, for example, appears comparable with 
the divergence between monocot and eudicot class 1 
genes despite a more restricted taxonomic range, sug- 
gesting a possible acceleration of evolutionary rate in the 



class 2 lineage. This acceleration, if confirmed, would be 
an interesting evolutionary feature of the class 2 symbi- 
otic genes. It might, however, contribute to possible 
phylogenetic artefact. 

These arguments emphasize the need for a solid 
rooting of the haemoglobin tree. This would help in 
clarifying the phylogenetic history of the gene family in 
symbiotic and nonsymbiotic plants. Furthermore, the 
measurement of rate variation between lineages being 
highly dependent on the assumed phylogeny and root- 
ing, this would also allow assessing the extent of 
evolutionary rate variation between lineages, possibly 
connecting features of the molecular evolutionary pro- 
cess in haemoglobin to the evolution of symbiosis. In this 
study, we made use of Nymphaeaceae Euryale ferox, a 
deeply-branching lineage of the angiosperm tree, to root 
the haemoglobin tree, and reconsider the molecular 
evolution of plant haemoglobins. 

Material and methods 

Young leaves from the Nymphaeaceae E. ferox were 
collected in the botanical garden of Montpellier (France). 
DNA extraction was performed with the DNeasy Plant Kit 
(QIAGEN, Courtaboeuf, France). 

Degenerate primers for polymerase chain reaction 
(PCR) were defined from aligned plant haemoglobin 
sequences available in GenBank, and used to amplify 
each haemoglobin gene in two overlapping regions. Class 
1 primers are: TTCAGYGARGAGCARGARGC (exon 1 
forward)/GTGGWCTCCCTCACSGTRAC (exon3 reverse), 
and GACTCCGACCTGCCACTTGAAC (exon 2 forward)/ 
GCRGCBACCARYTGRTCRTARGC (exon 4 reverse). 
Class 2 primers are: TTCAGYGARGAGCARGARGC (exon 
1 . forward) /GTRGCRCCYAGYCTYTTC AG (exon 3 
reverse), and AACCCSMARCTCARGYCYCAT (exon 2 
forward)/GCMRCCAACTKRTCRTARGC (exon 4 re- 
verse). The PCR reactions included Taq DNA polymerase 
buffer (Promega)/1.5 him MgCl2/200 yaA each dNTP/ 




Class II eudicots (nS and S) 

Class II nionocols (nS) 
Class I eudicots (nS) 

Class I monocots (nS) 




Class II eudicots (nS and S) 



Class 1 eudicots (nS) 



Class I monocots (nS) 



Fig. 1 Evolutionary history of plant haemoglobin genes: competing hypothesis, (a) A gene duplication gave rise to class 1 and class 2 
haemoglobins before the divergence between monocots and eudicots. Class 2 haemoglobins have been lost (or are still undetected) in cereals, 
(b) A gene duplication occurred in the eudicot lineage - black dot: gene duplication; gray dot: monocots/eudicots divergence; S: symbiotic gene; 
nS: nonsymbiotic gene. 
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variable quantities of each primer, depending on their 
degeneracy /genomic DNA/5 units Taq DNA polymerase 
(Promega) in a total volume of 50 fit. The reaction 
consists of a first step at 94 °C for 2 min, followed by 30 
cycles with a denaturation step at 91 °C for 30 s, an 
annealing step at variable temperature (from 55 °0 to 
62 °C) for 20 s and an extension step at 72 °C for 30 s. 

The amplified fragments, purified with the GenElute 
PCR DNA purification kit (Sigma-Aldrich, Saint Quentin 
Fallavier, France), were cloned with the pGEM-T Easy 
Vector System (Promega). For each amplified fragment 
four to six clones were sequenced using the ThermoS- 
equenase sequencing kit with dye primer (Amersham 
Pharmacia Biosciences, Orsay, France) following the 
provider's instructions. The sequences were analysed 
with the ALFwin Sequence Analyser (Amersham Phar- 
macia Biosciences). 

Phylogenetic analyses were performed on 52 plant 
haemoglobin amino-acid sequences (143 sites, 132 sites 
with gaps removing, 121 variable sites, 110 informative 
sites). We used methods of maximum likelihood [using 
the JTT model (Jones et al, 1992)] with the program 
NJML+ (Ota & Li, 2001), and Neighbour- Joining (NJ, 
Saitou & Nei, 1987) with the program PHYLO_WIN 
(Galtier et al, 1996), with 1000 bootstrap replicates for 
each analysis. Phylogenetic analysis based on Bayesian 
inference was also performed with MrBayes program 
(Huelsenbeck & Ronquist, 2001). We used the JTT 
model assuming a discretized gamma distribution with 
four classes. 120 000 Monte Carlo Markov chain 
(MCMC) steps were performed using four Markov 
chains running at different temperatures. The 20 000 
first steps were discarded (burnin). Trees were sampled 
every 100 steps, and a consensus tree (out of 1000 
sampled trees) was constructed. Clade posterior prob- 
abilities allow evaluation of the robustness of each 
node. 

Pairwise distances between protein sequences were 
calculated using the PAM correction for multiple sub- 
stitutions (Dayhoff et al 1978), as implemented in 
program PROTDIST from package PHYLIP (Felsenstein 
1989). These distances, used to build the NJ tree, were 
also used to examine sequence similarities between the 
newly isolated sequences of E. ferox and others plant 
haemoglobin genes. 

The heterogeneity of evolution rate between lineages 
was measured using a generalization of the relative-rate 
test (Wu & Li, 1985) to more than three species 
(Robinson-Rechavi et al, 1998; Robinson-Rechavi & 
Huchon, 2000). The synonymous (Ks) and nonsynony- 
mous (Ka) evolutionary distances were estimated for 
each sequence pair within the group of eudicot class 1 
genes (11 genes from eight different species, 55 pairs), 
the group of monocot class 1 genes (four genes from 
three different species, six pairs), and the group of class 2 
(symbiotic) genes of legumes (27 genes from 1 1 different 
species, 351 pairs). We used the method of Yang & 



Nielsen (2000) as implemented in program ynOO of 
PAML (Yang, 1997). 

Results and discussion 

Two haemoglobin sequences in Nymphaeaceae 
E. ferox 

All the fragments produced by degenerate amplifications 
on the E. ferox genomic DNA were cloned and sequenced. 
Two sequences (called Hbnl and Hbn2) showed signifi- 
cant similarity to known haemoglobin sequences. Hbn2 
shows three introns and four exons in positions identical 
to those of other plant haemoglobins (Jensen et al, 1981; 
Landsmann et al, 1986). Concerning Hbnl, amino-acid 
sequence similarity, to known haemoglobin sequences 
was recovered when a nonstandard intron-exon map 
was used: exon 3 appears 42 nucleotides longer (and 
exon 4, 42 nucleotides shorter) than in other plant 
haemoglobin genes. PAM distances on amino acid 
sequences were estimated between each of these two 
new genes and all others known sequences. Hbnl is 
much closer to the class 1 haemoglobins (average PAM 
distance: 0.41 replacements per site) than to class 2 
nonsymbiotic (0.86 rep. per site) and symbiotic (1.19 rep. 
per site) haemoglobins. In contrast, Hbn2 is more similar 
to class 2 nonsymbiotic genes (0.76 rep. per site) than to 
class 1 (0.90 rep. per site) and symbiotic (1.05 rep. per 
site) haemoglobins. These results show that at least two 
haemoglobin genes actually exist in E. ferox, presumably 
belonging to class 1 (Hbnl) and class 2 (Hbn2), respect- 
ively. 

Figure 2 presents the phylogenetic tree of 52 plant 
haemoglobin genes, constructed from 132 amino-acid 
sites with phylogenetic analysis based on Bayesian 
inference. Both groups of class 1 and class 2 genes are 
supported by high clade posterior probabilities (0.97 and 
1.00 respectively). Phylogenetic reconstructions obtained 
using the NJ and maximum of likelihood methods are 
congruent with the Bayesian tree in that they present 
two strongly supported clades composed of class 1 genes 
(including Hbnl), and class 2 genes (including Hbn2) 
respectively. Note that both the class 1 and class 2 genes 
of E. ferox branch out as the earliest diverging lineages, in 
agreement with recent angiosperm molecular phyloge- 
nies (Soltis et al, 1999; Soltis et al, 2000). These results 
unambiguously confirm the previous rooting of the 
haemoglobin tree (Hunt et al, 2001), i.e. the gene 
duplication that gave rise to class 1 and class 2 haemo- 
globins is older than the split between monocots and 
eudicots. Thus, class 2 haemoglobins must have been lost 
(or are still undetected) in cereals, and more research is 
needed to find out whether the lack of class 2 haemo- 
globin is restricted to cereals, or general to all monocots. 

The unambiguous rooting of the tree gives an oppor- 
tunity to compare molecular evolutionary rates between 
haemoglobin lineages. 
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Fig. 2 Bayesian tree: 52 species, 132 amino 
acid sites. Clade posterior probabilities are 
given for each node. Neighbour- Joining 
bootstrap supports (1000 replicates) are 
indicated within parenthesis, lhb: symbiotic 
haemoglobin of legumes; S: nonlegume 
symbiotic gene; nS: nonsymbiotic gene. 
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Evolutionary rates of plant haemoglobins 

As long as it is assumed that the right topology has been 
inferred, the estimation of the branch lengths of a 
phylogeny may provide useful information about 
molecular evolutionary processes. In the case of the 
plant haemoglobin phylogeny, several nodes remain 
unresolved within each group of genes (Fig. 2). How- 
ever, at this time we have sufficiently reliable knowledge 
of angiosperm phylogeny (Soltis et ah, 1999; Soltis et ah, 
2000), which can help to resolve some of these ambigu- 
ities. We thus modified the haemoglobin gene phylogeny 
to make it in agreement with the canonical angiosperm 
phylogeny, grouping together genes belonging to the 
same family or order. No change was made within the 
legume class 2 group, in which many recent duplication 
events have occurred, confusing species phylogeny. 
Branch lengths of the haemoglobin gene tree were then 
estimated by fitting PAM distances to the modified 



topology (least square fit). This 'true' haemoglobin 
phylogeny (Fig. 3) shows longer branches for the class 
2 sequences, and especially for symbiotic genes. The level 
of significance of this trend was assessed using a gener- 
alization of the relative-rate test to more than three 
species (Robinson-Rechavi et ah, 1998). The difference of 
amino-acid substitution rates between class 1 and class 2 
genes, using moss sequences as out-group, was found to 
be highly significant (P < 0.001). 

The acceleration observed for class 2 genes might have 
been caused by an increase of mutation rate (e.g. if 
located in a rapidly evolving genomic region). Alternat- 
ively, this pattern might reflect a change in the selective 
forces applying to the haemoglobin gene. To distinguish 
between these two hypotheses, we contrasted the syn- 
onymous (Ks) and nonsynonymous (Ka) evolutionary 
rates for each sequence pair within two groups of class 1 
genes (eudicots and monocots respectively) on one hand, 
and the group of symbiotic class 2 genes of legumes on 
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Fig. 3 Haemoglobin gene phylogeny: 
52 species, 132 amino acid sites. Branch 
lengths were estimated by fitting PAM 
distances between amino-acid sequences to a 
modified haemoglobin tree topology (see 
text). 



the other hand. Under the hypothesis of a variable 
mutation rate but a constant selective regime, an equal 
Ks/Ka ratio would be expected in all groups. Figure 4 
leads to rejection of this hypothesis: the Ks/Ka ratio is 
much lower in the class 2 symbiotic group (average: 4.6) 
than in class 1 groups (average: 9.3 for eudicots and 12.5 
for monocots). This indicates that the selective forces 
applying (or having applied) to symbiotic class 2 haemo- 
globins are distinct from those applying to class 1 
haemoglobins. 

These within-group comparisons do not reflect the 
evolutionary process that accompanied the functional 
divergence of the two genes after the duplication, but 
rather what occurred after their current function was 
acquired. Again, two main hypotheses might be invoked 
to explain the observed difference in Ks/Ka ratio between 
the two lineages. First, a lower Ks/Ka in class 2 genes 
might result from the relaxation of functional 



constraints, i.e. a decrease of the strength of purifying 
selection. Amino acids might be freer to vary in class 2 
than in class 1 genes, for a yet undetermined reason. This 
is the simplest explanation, consistent with the neutralist 
theory, and should be considered as the null hypothesis. 
Alternatively, recurrent adaptation might also explain an 
increase of nonsynonymous substitution rate in class 2 
symbiotic genes. Advantageous (nonsynonymous) muta- 
tions have a higher fixation probability than neutral 
mutations, and contribute to decreasing the Ks/Ka ratio. 

The reason why the adaptive hypothesis could be 
considered as a reasonable alternative to the neutralist 
explanation in this case is that the legume haemoglobins 
are functionally involved in the symbiosis between 
legumes and Rhizobium. Interactions between species 
potentially generate the need for a recurrent adaptation 
(Red-Queen-like evolution), one species having to adapt 
to the innovations found by the other one. Nitrogen 
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Fig. 4 Distribution of pairwise Ks/Ka ratios for legume symbiotic 
genes (a), eudicots class 1 genes (b) and monocots class 1 genes (c). 
Within each group, synonymous and nonsynonymous distances 
were estimated for all sequence pairs. The vertical lines indicate the 
average values. 

fixation by bacteria, for instance, involves the nitroge- 
nase enzyme, which requires substantial amounts of 
energy (i.e. ATP) produced by bacteroid respiration. 
However, the oxygen necessary for respiration readily 
inhibits the activity of nitrogenase. Haemoglobin thus 
appears to be a key factor in the process of symbiotic 
exchange by facilitating oxygen diffusion to the nitrogen- 
fixing symbionts (Appleby, 1992). From an evolutionary 
perspective, increasing or decreasing the affinity of 
haemoglobin to oxygen might therefore be a way for 



legumes to control the bacterial growth and amount of 
symbiotic exchange or to adapt to a new strain of bacteria 
(Bever & Simms, 2000; Denison, 2000; West et ah, 2002). 

Adaptation at the sequence level can definitely be 
invoked when Ka is higher than Ks, a pattern incompat- 
ible with neutral evolution or purifying selection. This is 
not what we observed for haemoglobin. Not observing 
Ka/Ks, however, does not imply that adaptation has not 
occurred. Indeed, this adaptive evolution might have 
involved only a (small) fraction of sites, and/or have 
occurred in specific branches of the tree. Ks/Ka measure- 
ments averaged over sites and over lineages make the 
detection of episodes of adaptive evolution difficult, and 
additional analyses involving models allowing for vari- 
able Ks/Ka ratio among sites/branches (Goldman & Yang, 
1994; Yang, 1997; Yang, 2000) are required for distin- 
guishing between the two hypotheses. 
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